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Kosakonia sacchari sp. nov. is a new species within the new genus Kosakonia, which was 
included in the genus Enterobacter. K. saccfiari is a nitrogen-fixing bacterium named for its 
association with sugarcane {Saccliarum officinarum L.). K. sacciiari bacteria are Gram- 
negative, aerobic, non-spore-forming, motile rods. Strain SPl^ (=CGMCC1 .12102^=LMG 
26783^) is the type strain of the _K. sacchari sp. nov and is able to colonize and fix N2 in 
association with sugarcane plants, thus promoting plant growth. Here we summarize the 
features of strain SPl^ and describe its complete genome sequence. The genome contains a 
single chromosome and no plasmids, 4,902,024 nucleotides with 53.7% GC content, 4,460 
protein-coding genes and 105 RNA genes including 22 rRNA genes, 82 tRNA genes, and 1 
ncRNA gene. 



Introduction 

The genus Enterobacter belonging to the family 
Enterobacteriaceae is polyphyletic based on 16S 
rRNA gene sequence analysis [1-3]. Recently, 
eleven species belonging to the genus Enterobacter 
were transferred into the genus Cronobacter and 
three novel genera [Lelliottia, Pluralibacter, and 
Kosakonia) based on multilocus sequence analysis 
of protein-coding genes, rpoB [RNA polymerase (B- 
subunit gene), gyrB [DNA gyrase subunit B gene), 
infB [initiation translation factor 2 gene), and atpD 
[ATP synthase (B-subunit gene) [1]. Enterobacter 
cowanii, E. radicincitans, E. oryzae and E. arachidis 
were reclassified as Kosakonia cowanii, K. 
radicincitans, K. oryzae and K. arachidis, 
respectively [1]. Enterobacter sacchari is a new 
species named for nitrogen-fixing bacteria in 
association with sugarcane [Saccharum 
officinarum L.) [2,4] and has been reclassified as 
Kosakonia sacchari [3]. K. sacchari is able to 
colonize sugarcane plants, fix N2 in association 
with sugarcane plants and promote plant growth 
[4]. K. sacchari strain SPIT was isolated from a 
surface-sterilized stem of sugarcane cultivar GTll 
grown in Nanning, Guangxi, China in 1994. It has 



now been designated the type strain of K. sacchari 
sp. nov [2,3]. Here we present a summary of its 
features [2] and the complete genome sequence 
and annotation for _K. sacchari strain SPIT 
[=CGMCC1.12102T=LMG 26783T). 

Organism information 

Classification and general features 

K. sacchari type strain SPIT is a Gram-negative, 
non-spore-forming, motile rod with peritrichous 
flagella [Figure 1., Table l.[2]). It grows aerobically 
but reduces N2 to NH3 at a low p02. It is able to 
grow and fix N2 on media containing 10% [w/v) 
cane sugar or sucrose and forms circular, convex, 
smooth colonies with entire margins on solid 
media. It grows best around 30°C and pH 7. 
Phylogenetic analysis of the 16S rRNA gene 
sequences from SPIT, the type strains of species of 
the genus Enterobacter and the type strains of type 
species of other genera in the family 
Enterobacteriaceae showed that SPIT formed a 
monophyletic group with the type strain of E. 
cloacae [the type species of the genus 
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Enterobacter) [2]. However, phylogenetic analysis 
of the rpoB gene sequences showed that SPIT 
diverged from E. cloacae [2]. Here, phylogenetic 
analysis of the 16S rRNA gene sequences from 
SPIT, other type strains in the genus Kosakonia, 
Like typical members in the genera Enterobacter 
and Kosakonia, K. sacchari SPIT utilizes L-alanine, 
D-cellobiose, citrate, D-fructose, D-galactose, D- 
glucose, glycerol, maltose, D-mannitol and D- 
mannose [2,6,7]. K. sacchari differentiates from E. 
cloacae by utilization of D-arabitol and L-fucose, 



and the type strain of E. cloacae showed that K. 
sacchari formed a monophyletic group with K. 
radicincitans, K. oryzae, and K. arachidis and 
diverged from K. cowanii [the type species of the 
genus Kosakonia) and E. cloacae [Figure 2.). 
differentiates from K. radicincitans by utilization of 
putrescine, D-arabitol, L-fucose and a-methyl-D- 
glucoside, and differentiates from K. oryzae by 
utilization of putrescine, D-arabitol and L- 
rhamnose [2]. 




Figure 1. Transmission electron micrograph showing a negative-stained 
cell of the Kosakonia sacchari type strain SPIT [2]. The scale bar 
represents 1 \im. 
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Figure 2. Phylogenetic tree based on 16S rRNA gene sequences of Kosakonia sacchari type strain SPIT (□), the type 
strains of other species in the genus Kosakonia, and the type strain of Enterobacter cloacae. The sequences were 
aligned with the CLUSTAL W program and were constructed with the neighbor-joining algorithm integrated in the 
MEGA 5.0 program [5]. The phylogenetic tree was tested with 1,000 bootstrap replicates. Bootstrap values are shown at 
the nodes. The GenBank accession numbers of the sequences are indicated in parentheses. The scale bar represents a 
0.1% nucleotide sequence divergence. 
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Table 1 . Classification and general features of Kosakonia sacchari type strain SP1 T according to 
the MIGS recommendations 



MIGS ID 



Property 



Current 
classification 



Term 



Domain Bacteria 
Phylum Proteobacteria 
Class Gammaproteobacteria 
Order Enterobacteriales 
Family Entembacteriaceae 
Genus Kosakonia 
Species Kosakonia sacchari 
Type strain: SPI^ 



Evidence code 



TAS [16] 
TAS [17] 
TAS [18-20] 
TAS [21] 
TAS [22,23] 
TAS [1,3] 
TAS [2,3] 
TAS [2,3] 



MIGS-6 
MIGS-6.3 
MlGS-22 
MIGS-23 

MIGS-15 

MIGS-14 
MIGS-4 

MIGS-5 

MIGS-4.1 
MIGS-4.2 

MIGS-4.3 
MIGS-4.4 



Gram stain 

Cell shape 

Motility 

Sporulation 

Temperature 

range 

Optimum 

temperature 

Carbon source 



Energy source 

Habitat 

Salinity 

Oxygen 

Isolation 

Biotic 

relationship 
Pathogenicity 

Geographic 
location 

Sample collection 
time 

Longitude 
Latitude 

Depth 
Altitude 



Negative TAS [2] 

Rod TAS [2] 

Motile TAS [2] 

Non-sporulating TAS [2] 

Mesophile TAS [2] 

28 - 32°C TAS [2] 

Sucrose, glucose, fructose, TAS [2] 
galactose, 

maltose, mannitol, mannose, 
arabitol 

Chemoorganotroph TAS [2] 

Soil, plants IDA 

0 - 4% NaCI TAS [2] 

Aerobe TAS [2] 

Stem of sugarcane cultivar GT1 1 TAS [2] 

Free-living, endophytic IDA 
Not reported 

Nanning, Guangxi, China TAS [2] 

1 994 TAS [2] 

108.33 NAS 

22.84 NAS 

0.1 - 0.5 m above the surface IDA 

76 m NAS 



Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a 
direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly 
observed for the living, isolated sample, but based on a generally accepted property for the 

species, or anecdotal evidence) 

Genome sequencing information 
Genome project history 



K. sacchari SPIt was selected for sequencing 
because it is the type strain of K. sacchari, and on 
the basis of its scientific interest as an endophyte 
that has the potential to promote the growth of 
agriculturally important crops by nitrogen fixation 
[8]. Its 16S rRNA gene sequence is deposited in 



GenBank under the accession number JQ001784. 
Its genome sequence is deposited in GenBank 
under the accession number CP007215.2. A 
summary of the genome sequencing project 
information and its association with MIGS version 
2.0 compliance is shown in Table 2. 
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Table 2. Genome sequencing project information for Kosakonia sacchari type strain SPI^ 
M'GS ID Property Term 



MIGS-31 


Finishing quality 


Finished 


MIGS-28 


Libraries used 


Pacbio 4 - 1 0 Kb library 


MIGS-29 


Sequencing platforms 


PacBio RS II 


MIGS-31. 2 


Fold coverage 


63 X 


MIGS-30 


Assemblers 


HGAP in smrtanalysis-2.1 .1 


MIGS-32 


Gene calling method 


GeneMarkS+ 




Genome Database release 


Genbank 




Genbank ID 


CP007215.2 




Genbank Date of Release 


May 23, 2014 




Project relevance 


Taxonomy, biotechnology 



Growth conditions and DNA isolation 

K. sacchari SPIt was grown in liquid Luria-Bertani 
[LB) medium at 30°C to early stationary phase. 
The genome DNA was extracted from the cells by 
using a TIANamp bacterial DNA kit (Tiangen 
Biotech, Beijing, China). DNA quality and quantity 
were determined with a Nanodrop spectrometer 
(Thermo Scientific, Wilmington, USA). 

Genome sequencing and assembly 

The genome DNA of K. sacchari strain SPIt was 
first constructed into a 500-bp-insert library and 
sequenced by an Illumina HiSeq 2000 sequencing 
system. A draft genome of 4,945,084 nucleotides 
containing 239 contigs was obtained and 
deposited at DDBJ/EMBL/GenBank under the 
accession no. AMSCOOOOOOOO [8]. However, 
84,628 nucleotides [203 short contigs) of the draft 
genome were accidently contaminated by 
sequences from eukaryotic organisms. Therefore, 
the genome of SPl^ was resequenced at the Duke 
University Genome Sequencing & Analysis Core 
Resource using the Pacific Biosciences' Single 
Molecule, Real-Time (SMRT) sequencing 
technology [ http : / / www.pacificbiosciences.com / ). 
A 4 - 10 Kb insert library was constructed. 
Sequencing was run on a single SMRT Cell. The 
sequencing data were assembled using the 
Hierarchical Genome Assembly Process [HGAP) 
with smrtanalysis-2.1.1. The final assembly of the 
chromosome produced 63-fold coverage of the 
genome. 



Genome annotation 

Automated genome annotation was completed 
using the NCBI Prokaryotic Genome Annotation 
Pipeline. Product description annotations were 
obtained using searches against the KEGG, 
InterPro, and COG databases. Genes with signal 
peptides were predicted using SignalP [9]. Genes 
with transmembrane helices were predicted using 
TMHMM [10]. Genes for tRNA were found by 
tRNAScanSE [11]. Ribosomal RNAs were found by 
using BLASTN vs. ribosomal RNA databases, and 
5S rRNA hits were further refined using Cmsearch 
[ http://manpages.ubuntu.com/manpages/rari 
ng/manl/cmsearch.l.html ). Two hundred 
twenty seven disrupted genes were replaced by 
the complete gene sequences obtained from the 
first Illumina HiSeq 2000 sequencing. 

Genome properties 

The genome of K. sacchari SPIt contains a single 
chromosome of 4,902,024 nucleotides with 53.7% 
GC content and no plasmids [Table 3, Figure 3.). 
The genome contains 4,585 predicted genes, 4,460 
protein-coding genes and 105 RNA genes including 
22 rRNA genes, 82 tRNA genes and 1 ncRNA gene. 
A total of 3,752 genes [81.8%) were assigned a 
putative function. The remaining genes were 
annotated as hypothetical or unknown proteins 
[Table 3). The distribution of genes into COGs 
functional categories is presented in Table 4 
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Attribute 


Value 


% of total 


Size (bp) 


4,902,024 


100.00 
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Q7 9 7 


Pseudo genes 




U.44 


Genes assigned to COGs 


3,786 


82.57 


Genes with signal peptides 


452 


9.86 


Genes with transmembrane helices 


1096 


23.90 




Figure 3. Graphical circular map of the chromosome of Kosakonia sacchari type strain SPIT. From outside to 
the center: Genes on forward strand (color by SEED subsystems [12]), Genes on reverse strand (color by SEED 
subsystems), genome structure (a circular chromosome with no gaps), GC content, GC skew. 
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Table 4. Number of genes associated with the 25 general COG functional categories 
Code Value %oftotal^ Description 



J 


193 


4.33 


Translation 


A 


2 


0.04 


RNA processing and modification 


K 


386 


8.65 


Transcription 


L 


170 


3.81 


Replication, recombination and repair 


B 


0 


0.00 


Chromatin structure and dynamics 


D 


38 


0.85 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.00 


Nuclear structure 


V 


52 


1.17 


Defense mechanisms 


T 


269 


6.03 


Signal transduction mechanisms 


M 


251 


5.63 


Cell wall/membrane biogenesis 


N 


128 


2.87 


Cell motility 


Z 


0 


0.00 


Cytoskeleton 


W 


0 


0.00 


Extracellular structures 


u 


107 


2.40 


Intracellular trafficking and secretion 


o 


144 


3.23 


Posttranslational modification, protein turnover, 
chaperones 


c 


268 


6.01 


Energy production and conversion 


G 


394 


8.83 


Carbohydrate transport and metabolism 


E 


414 


9.28 


Amino acid transport and metabolism 


F 


90 


2.02 


Nucleotide transport and metabolism 


H 


186 


4.17 


Coenzyme transport and metabolism 


1 


117 


2.62 


Lipid transport and metabolism 


P 


265 


5.94 


Inorganic ion transport and metabolism 


Q 


82 


1.84 


Secondary metabolites biosynthesis, transport and 


catabolism 


R 


481 


10.78 


General function prediction only 


S 


382 


8.57 


Function unknown 




674 


15.11 


Not in COGs 



a) The total is based on the total number of protein coding genes in the annotated genome. 

Insights from the genome. 

Comparison with the genome of Enterobacter sp. strain R4-368 

The chromosome of K. sacchari SPV shows the strain R4-368. The digital DNA-DNA 

highest sequence similarities ranging from 69.5% hybridization values between the two 

to 100% to the chromosome of Enterobacter sp. chromosomes calculated by the onUne 

strain R4-368, which is an endophytic nitrogen- Genome-to-Genome Distance Calculator 

fixing bacterium isolated from the biofuel plant .^5] (version 2.0; http://ggdc.dsmz.de ) are 

Jatropha curcas [13]. The genome of the strain R4- 20/0, 57.7%, and 86.6% under the distance 

368 comprises a single circular chromosome of „ ^ ^ r j j r j i- -^i 

5,039,027 bp with 54.0% GC content (deposited in ^^^^^^^f ^' ^ (recommended for dealmg with 

GenBank under the accession number CP005991) incomplete genomes), and 3, respectively. The 

and one plasmid pENTOl of 116,007 bp with probabilities of same species for the two 

52.8% GC content (deposited in GenBank under strains (DDH > 70%) assessed via logistic 

theaccessionnumber CP005992) [13]. regression are 97.4%, 44.3%, and 98.8%, 

The chromosome of K. sacchari SPIT shares respectively. Likely, strain R4-368 belongs to 

4,105 genes (89.5%) with the chromosome of ^he species K. sacchari. 
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